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Abstract 

We bound the j-invariant of S-integral points on arbitrary modular curves over arbitrary 
fields, in terms of the congruence group defining the curve, assuming a certain Runge condition 
is satisfied by our objects. We then apply our bounds to prove that for sufficiently large prime 
p, the points of X^~(p r )(Q) with r > 1 are either cusps or CM points. This can be interpreted 
as the non-existence of quadratic elliptic Q-curves with higher prime-power degree. 
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1 Introduction 

Let N > 2 be an integer and X(N) the principal modular curve of level N. Further, let G a sub- 
group of GL2(Z/A r Z) containing —1 and let Xq be the corresponding modular curve. This curve 
is denned over Q(Cw) dct(G) , so in particular it is defined over Q if det(G) = (Z/iVZ) x . (Through 
all this paper, we say that an algebraic curve is defined over a field if it has a geometrically integral 
model over this basis.) As usual, we denote by Yg the finite part of Xq (that is, Xq deprived of 
the cusps). If Xq is defined over a number field K, the curve Xq has a natural (modular) model 
over O = Ok that we still denote by Xq- The cusps define a closed subscheme of Xq over O, and 
we also call Yg the relative curve over O which is Xq deprived of the cusps. If S is a finite set 
of places of K containing the infinite places, the set of S-integral points Yq{Os) consists of those 
P G Yq{K) for which j(P) £ Os, where j is, as usual, the modular invariant and Os = Ok,s is 
the ring of S'-integers. 

In its simplest form, the first principal result of this article gives an explicit upper bound for 
j{P) € Z under certain Galois condition on the cusps. 

Theorem 1.1 Assume that Xq is defined over Q, and assume that the absolute Galois group 
Gal(Q/Q) acts non-transitively on the cusps of Xg- Then for any P E Yg(%) we have 

log|j(P)| < 12|G|iV 2 log37V. (1) 
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This result was announced in [ill- Because of an inaccuracy in the proof given in pQ, the 
log-factor is missing therein (this, however, does not affect the arithmetical applications of that 
theorem, which by the way has since been drastically improved in [2J, see below). 

Actually, we obtain a more general Theorem 11.21 below, which applies to any number field and 
any ring of S'-integers in it. To state Theorem ll.2l we need to introduce some notations. We denote 
by h(-) the usual absolute logarithmic height (see Subsection II. 1| . For P E Xg(Q)) we shall write 
h(P) = h(j(P)). For a number field K we denote by C = C(G) the set of cusps of Xq, and by 
C{G,K) the set of Gal(^/ J ftT)-orbits of C. 

Theorem 1.2 Let K be a number field and S a finite set of places of K (containing all the 
infinite places). Let G be a subgroup of GL2(Z/iVZ) such that Xq is defined over K . Assume 
that \C(G, K)\ > \S\ (the "Runge condition"). Then for any P G Y G (O s ) we have 

h(P) < 36s s / 2+1 (N 2 \G\/2) S log 27V, (2) 

where s = \S\. If S = Mg?, we even have 

h(P) < 2As s/2+1 (N 2 \G\/2) s \og3N. (3) 

Theorem ll.il is a special case of this theorem. 

This theorem is proved in Section [5] by a variation of the method of Runge, after some prepa- 
ration in Sections [2l [3] and |U For a general discussion of Runge's method see [3l [12]. 

Theorems 11.11 and 11.21 apply for a general group G. For a specific G, one may obtain much 
stronger results. One such example can be found in [2j. In Section [6] of the present article we 
apply our general strategy in the case when N — p is a prime number, the field K is quadratic 
and G is a split Cartan subgroup; in this case we obtain a much sharper estimate for the integral 
points, than what follows by direct application of Theorem 11.21 In Section [7] we apply this result 
to the arithmetic of modular curves, using the integrality property established in Section [8] 

Theorem 1.3 There is an absolute constant po such that, ifr> 1 and p > po, the set Xq (p r )(Q) 
is made of cusps and CM points. Equivalently, there is no quadratic Q-curve of prime power 
degree p r with exponent r > 1 when this prime is large enough. 

Recall that a Q-curve is an elliptic curve without complex multiplication over Q which is 
isogcnous to each of its conjugates over Q. When such a curve is quadratic (that is, defined over a 
quadratic field) , we say it has degree N if the minimal degree of an isogeny from the curve to its 
only non-trivial conjugate is N. For concrete examples of quadratic Q-curves see |7] and references 
therein. 

Theorem 11.31 extends the principal result of [5] and essentially settles the problem tackled 
in [18], US] or [21]. 



1.1 Notations, Conventions 

Everywhere in this article log and arg stand for the principal branches of the complex logarithm 
and argument functions; that is, for any z € C x we have — ir < Im logz = argz < tt. In Section 
2 of this article we shall systematically use, often without special reference, the estimates of the 
kind 



|log(l + ,)|<M^|, |e*-l|<^|, 

\f ^ _i_ £7*^^ ^ sAv I 

Ul + z) A - 1 - Az\ < ^ — L|z| 2 (AgR, e = signA), 



(4) 



etc., for \z\ < r < 1. They can be easily deduced from the maximum principle. 

Let H denote Poincare upper half-plane: H = {r € C : Imr > 0}. For reWwe put q T — e 2mT 
(or simply q if there is no ambiguity). We put H = Tt U Q U {ioo}. If T is the pull-back of 
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G n SI^Z/A^Z) to SL 2 (Z), then the set X<j(C) of complex points is analytically isomorphic to the 
quotient Xr — supplied with the properly defined topology and analytic structure [TTl 122) . 

We denote by D the standard fundamental domain of SL2(Z) (the hyperbolic triangle with 
vertices e 7 ™/ 3 , e 27 ™/ 3 and ioo, together with the geodesic segments [i,e 27 ™/ 3 ] and [e 27 ™/ 3 , ioo]). 
Notice that for r G D we have \q T \ < e _Trv/3 < 0.005, which will be systematically used without 
special reference. 

For a = (ai, a 2 ) G Q 2 we put £ a = B 2 (a 1 - [aiJ)/2 where B 2 {T) = T 2 - T + 1/6 is the sec- 
ond Bernoulli polynomial. The quantity £ a is Z 2 -periodic in a and is thereby well-defined for 
a G (Q/Z) 2 as well: for such a we have £ a = B 2 (5i), where Si is the lifting of the first coordinate 
of a to the interval [0,1). Obviously, \£ a \ < 1/12; this will also be often used without special 
reference. 

Wc fix, once and for all, an algebraic closure Q of Q, which is assumed to be a subfield of C. In 
particular, for every a G Q we have the well defined root of unity e 27rm G Q. Every number field 
used in this article is presumed to be contained in the fixed Q. If K is such a number field and v 
is a valuation on K, then we tacitly assume than v is somehow extended to Q = K; cquivalcntly, 
we fix an algebraic closure K v and an embedding Q K v . In particular, the roots of unity e 27 " a 
are well-defined elements of K v . 

For a number field K we denote by Mk the set of all valuations (or places) of K normalized to 
extend the usual infinite and p-adic valuations of Q: \2\ v = 2 if v G Mk is infinite, and \p\ v = p^ 1 
if v extends the p-adic valuation of Q. In the finite case we sometimes use the additive notation 
v(-), normalized to have v(p) = 1. We denote by M^? and M K the subsets of Mk consisting of 
the infinite (archimedean) and the finite (non-archimedean) valuations, respectively. 

Recall the definition of the absolute logarithmic height h(-). For a £ Q we pick a number field K 
containing a and put h(a) = [K : Q] _1 ^2 vG m k i^-v '■ Qv] log + where the valuations on K are 
normalized to extend standard infinite and p-adic valuations on Q and log + x = logmaxjo;, 1}. 
The value of h(a) is known to be independent on the particular choice of K. As usual, we extend 
the definition of the height to P X (Q) = Q U {oo} by setting h(oo) = 0. If a is a rational integer or 
an imaginary quadratic integer then h(a) = log \a\. 

If E is an elliptic over Q, define h(E) := h(js) for je its j-invariant. For P G X(Q) a point with 
values in Q of a modular curve X , the height we will use (unless explicitely mentioned otherwise) 
will be this naive Weil height of the elliptic curve associated to P, that is h(P) = h(j(P)). 

2 Estimates for Modular Functions at Infinity 

The results of this section must be known, but we did not find them in the available literature, so 
we state and prove them here. For the sake of further applications, we have tried to give rather 
sharp constants, even if we do not actually need this precision in the sequel of the present paper. 

2.1 Estimating the j-Function 

Recall that the modular ^'-invariant j : H — > C is defined by j(r) — (12c2(t)) 3 /A(t), where 

, , (2^) 4 / ^ nV \ 

C2(r) = ^r ( 1+240 Ei^J 

(see, for instance, [101 Section 4.2]) and A(r) = (27r) 12 grj^Li(l — q n ) 2A - Also, j has the familiar 
Fourier expansion j(r) = q^ 1 + 744 + 196884g + . . .. 

Proposition 2.1 For r G Tt such that \q\(= \q T \) < 0.005 (and, in particular, for every r G D) we 
have 

|j(r) - q- 1 - 744| < 330000|g|. (5) 
(Recall that D is the standard fundamental domain for SL2(Z).) 
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Proof Using the estimate n 3 < 3™ for n > 3, we find that for \q\ < 1/3 



12 



■c 2 (r) - 1 - 240g 



< 240 



i - g 



n 3 | g |™ 



< 



240 



131' 

oo 



M 2 + 8|g| 2 + ^|3gr 



n— 3 



2160 



(1-| ? |)(1-3| ? |)' 



and for |g| < 0.005 we obtain 



12 



^.(r)- 1 - 240, 
Further, using Q, we obtain, for |g| < 0.005, 



< 2204W 



(6) 



log- 



A(r) 



= 24 



Hence 



(2tt 



A(r) 



1 - 24, 



< 1(1-9) 



-24 



$>g(l-?' 

n=2 

(27r) 12 «(l-«) M 



<24.l£>|" <24.3|«| : 



< 1.13 



log 



A(r) 
(27r) 12 g (l- g ) 24 



1(1-9) 



-24 



1 - 24g 



A(r) 



314|d 2 < 342|q|' 



Combining this with ([S]), we obtain ([5]) after a tiresome, but straightforward calculation. 

The following consequences are obtained by direct numerical computations. 
Corollary 2.2 For r € D the following statements are true. 

(i) We have |log|g T || < log(|j(r)| + 2400). 

(ii) We have either \j(r)\ < 3500 or \q T \ < 0.001. 

(in) If\j(r)\ > 3500 then \j(r) - q-^ < 1100 and ||j(r)| > |g- l | > ||j(r)|. 



□ 



□ 



2.2 Estimating Siegel's Functions 

For a rational number a we define q a = e 2maT . Let a = (ai, a 2 ) € Q 2 be such that a ^ Z 2 , and 
let 5 a : 7i — *■ C be the corresponding Siegel function pjjj Section 2.1]. Then we have the following 
infinite product presentation for g a [9l page 29] (where B 2 (T) is the second Bernoulli polynomial) : 



g> 



l(t) = — g S2 ( a i)/ 2 e 7ria2 ( ai_1 ) J^J (l — g"+ a i e 27ria2 ) (d — q 



n+l — ai g-2iTW2 *j 



We also have pages 29-30] the relations 

5a ° 7 = 5a 7 • (a root of unity) for 7 G T(l), 

.9a = <?a' ■ (a root of unity) when a = a' mod Z 2 . 



(7) 



(8) 
(9) 



Remark that the root of unity in ([8]) is of order dividing 12, and in (|9|) of order dividing 2N, where ./V 
is the denominator of a (the common denominator of a± and a 2 ; see [2] for more references). 
The order of vanishing of g a at ioo (that is, the only rational number I such that the limit 
lim q ~ l g a (r) exists and is non-zero) is equal to the number £ a , defined in Subsection 11.11 ( see [HI 

r — >ioo 

page 31]). 
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Proposition 2.3 Let a be an element of Q 2 \ Z 2 and N > 1 an integer such that Na. G Z 2 . 
for t G D we have 

log| 5a (r)| -41og|<Z T | < logiV. 



Then 
(10) 



Proof Due to ©, we may assume that < a\ < 1 and distinguish between the cases < a\ < 1 
and ai — 0. According to ([7]), the left-hand side of (fTT)|) is equal to 

oo 

log |1 - q ai (\ + log 1 1 - g 1 " 01 ^ + ( lo S I 1 - 1 n+ai C\ + log 1 1 - q n+1 ~ ai (\) 

n=l 

where ( = e 2Trm2 . Recall that, since t G D, we have |<?| < e -71 "^. Hence, for n > 1 each of |g" +ai | 
and U™+ 1 ^ a i I does not exceed 0.005, whence 



(log|l-g" +ai C| +log|l-g" +1 ~ ai C| 



n=l 

To estimate 



7 l+ai I 



„2-ai I 



< 1.005-1 



1-9 



llogll-g^CI+logll-^CI 



<3|g|<0.02. (11) 



(12) 



assume first that < a% < 1. Then among the numbers |g ai | and Iq 1 ai | one is bounded by l^l 1 ^ 
and the other is bounded by l^ 1 / 2 , which bounds (JT3J) by 



log 1 — e 



loffll-e-^ 2 ! 



< logiV - 0.1. 



This proves (fTU)) in the case < a x < 1. When a x = then a 2 ^ Z and £ 7^ 1, which bounds |T 
by 



logjl-e 2 ^! 
proving (TIT)]) in this case as well. 



logll - e _7rv/5 | 



< logA-0.1, 



□ 



Since Siegel's functions have no pole nor zero on the upper half plane Ti., they should be 
bounded from above and from below on any compact subset of Ti. In particular, they should be 
bounded where j is bounded. Here is a quantitative version of this. 

Corollary 2.4 Let a G Q 2 \ I? and N > 1 satisfy NaeZ 2 . Then for any t £ H we have 

|log|Sa(r)|| < ^log(|i(r)| +2400) + log AT. (13) 

Proof Replacing r by 7T and g a by g ai -i with a suitable 7 G T(l), we may assume that t £ D, 
in particular (flT))) holds. Combining (TIT))) with item Q of Corollary 12.21 and using the inequality 
|4| < 1/12, we obtain CEO). □ 



2.3 Non- Archimedean Versions 

We also need non-archimedean versions of some of the above inequalities. In this subsection K v is 
a field complete with respect to a non-archimedean valuation v and K v its algebraic closure. Let 
q G K v satisfy \q\ v < 1. For a = (0,%, a 2 ) G Q 2 such that a ^ Z 2 define g a ( r ) as in ([7]). Recall (see 
Subsection ll.ip that the roots of unity e 27rM with a G Q are defined as elements of A^. 

The right-hand side of ([Jj is, formally, not well-defined because we use rational powers of q. 
However, if we fix q x / 12N g K v , where N is the order of a in (Q/Z) 2 , then everything becomes well- 
defined, and we again have and ©. The statement of the following proposition is independent 
on the particular choice of q 1 / 1214 . 



5 



Proposition 2.5 Let a be an element of Q 2 \ Z 2 and N > 1 an integer such that Na £ Z 2 . Then 



where here and below v | p \ N means that v(N) > and p is the prime below v. 

Proof We again may assume that < oi < 1. When < ai < 1 we have log |g a (<z)|,j = log |g| 
When ai = we have 



log \g*(q)\ v = 4 log \q\ v + log |1 - e 2 ™ 2 1„, 
and |1 - e 27ri0 % = 1 if v(N) = 0, while if v \ p \ N we have 1 > |1 - e 2 ™ l2 \ v > p- 1 /^ 1 ). □ 

3 Locating a "Nearby Cusp" 



Let N be a positive integer, G a subgroup of GL2(Z/AZ) and Xq the corresponding modular 
curve, defined over a number held K. We hx a valuation w of K. Recall we denote by O the ring 
of integers of K and by K v the u-completion of K. When v is non-archimedean, we denote by O v 
the ring of integers of K v , and by k v its residue field at v. As usual C,n will denote a primitive A-th 
root of unity. Recall that when we say that a curve "is defined over" a field, it means that this 
curve has a geometrically integral model over that field. 

Let P be a point on Xq{K v ) such that |i(P)| is "large". Then it is intuitively clear that, in 
the w-adic metric, P is situated "near" a cusp of Xq- The purpose of this section is to make this 
intuitive observation precise and explicit. We shall locate this "nearby" cusp and specify what the 
word "near" means. 

We first recall the following description of the cuspidal locus of X(N) (for more details see 
e.g. [H Chapitres V and VII]). The cusps of X(N) define a closed subscheme of the smooth locus 
of the modular model of X(N) over Z[£jv]- Fix a uniformization X(N)(C) ~ H/T(N), let Coo be 
the cusp corresponding to ioo & 7Y, and write q x l N = e 2l7TT / N the usual parameter. If c = 7(000), 
for some 7 S SL2(Z), is another cusp, denote by q c := q o 7 _1 the parameter on X(N)(C) at c. It 
follows from 4, Chapitre VII, Corollaire 2.5] that the completion of the curve X(N) over Z[(n] 
along the section c is isomorphic to Spec(Z[£/v] [[(fc^]]). In other words, the parameter ql at c 
on X(N)(C) is actually defined over Z[£at], that is qV N comes from an element of the completed 
local ring Ox(N),c °f t ne modular model of X(N) over Z[^jv]j along the section c. 

Next we describe the local parameters at the cusps on an arbitrary Xq. As before, let T be the 
pull-back of G n SL2(Z/AZ) to T(l). For each cusp c of Xq we obtain a parameter at c on Xq 
by picking a lift c of c on X(N) and taking the norm LJ q^/ N o 7, where 7 runs through a set of 
representatives of T/T(N). We denote by t c this parameter in the sequel. Note that it is defined 
over a (possibly strict) subring of Z[£jv]- 

It is clear from the definition that the above parameter t c defines a w-analytic function on a 
f-adic neighborhood of c e Xg(K v ) which satisfies the initial condition t c (c) — 0. Further, if e c 
is the ramification index of the covering Xq — * X(l) at c (note that e c divides N) then, setting 
q c := t^. c , the familiar expansion j = q^ 1 + 744 + 196884g c + . . . holds in a w-adic neighborhood 
of c, the right-hand side converging w-adically. 

Let us define explicitly a set f2 c = £l CjV on which t c and q c arc defined and analytic. Assume 
first that v is archimedean. Let D be the fundamental domain D modified as follows: 



Then the set A = D + Z of translated of D by the rational integers has the following properties. 

(i) If for some r 6 A and 7 € T(l) we have j(t) £ A then r = 7(7-) mod 1. 

(ii) For any 7 £ T(l) either 7(A) = A or 7(A) n A = 0. 




D = D (J {too} \ (the arc connecting i and e 



2-7H/3 



(14) 



G 



(iii) The quotient A/T is an open subset of Tt/T = X G {C). (Recall that T is the pull-back of 
GnSL 2 (Z/7VZ) to SL 2 (Z).) 



The same properties hold for the set 7(A) for any 7 £ T(l). Now we define ft c — 7(A)/r, 
where 7 £ T(l) is chosen so that 7(100) represents the cusp c. It follows from the properties above 
that the sets fi c are pairwise disjoint, and that q c and t c are defined and analytic on Vt c . 

If v is non-archimedean, then we define f2 c = f2 c l) as the set of the points from X G (K V ) having 
reduction c at v. 



{P £ X G (K V ) : \j{P)\ > 3500} ifv £ Mj?, 
{P £ X G (K V ) : \j(P)\ > 1} ifv £ M° . 

X G (K V )+ C |J n Cl0 (15) 
cec 

with equality for the non-archimedean v. Also, for P £ Q cv we have 

\j(P) - qc{P)-\ <1100, (16) 

if v is archimedean, and \j(P)\ v = |<7 c (-P) _1 |t) h° v is non-archimedean. 




Proof For the non-archimedean v both statements are immediate. For archimedean v they 
follow from Corollary 12.21 Indeed, fix a uniformization X(K V ) = H/T and let To £ H be a lift of 
P £ X G {K V )+. Pick 7 £ r(l) such that r = 7(7-0) S D. Then |j(r)| = \j(P)\ > 3500, and item (O 
of Corollary 12.21 implies that \q T \ < 0.001 < e~ 27! . It follows that t £ D, which is equivalent to 
saying that P £ fl c with c = 7~ 1 (c 00 ). This proves (TT51) . and item (jliif of Corollary |2 . 21 implies (|16l) 
and (HZJ. □ 

The proposition implies that for every P £ X G {K V ) + there exists a cusp c such that P £ fl c ,v 
We call it a v-nearhy cusp, or simply a nearby cusp to P. 

Remark 3.2 As we have already seen, the sets fi c are pairwise disjoint when v is archimedean. 
The same is true if v is non-archimedean and v(N) — 0, as in this case the cusps define a finite 
etale scheme over O v . Thus, in these case the nearby cusp is well-defined. 

However, if v(N) > then the sets f2 c are not disjoint, see the addendum below; in particular, 
in this case a point in X G (K V ) may have several nearby cusps. This is absolutely harmless for 
our arguments, but for further applications it would be nice to refine the sets f2 c to make them 
pairwise disjoint; in particular, this would allow us to define "the ^-nearest cusp" rather than just 
"a w-nearby cusp" for any v. In the addendum below we examine more carefully geometry of 
cusps and their neighborhoods in the non-archimedean case. In particular, we explicitly exhibit a 
pairwise disjoint system of v-adic cusp neighborhoods. 

Addendum: more on the cusps and their v-adic neighborhoods^ 

Recall first of all that the modular interpretation of X(N) associates with each cusp a Neron 
polygon C with N sides on Z[£zv], endowed with its structure of generalized elliptic curve, and 
enhanced with a basis of C[N] ~ TLjNTL x fj, N = (q 1 ^ N ,(N) such that the determinant of this 
basis is 1, and two bases are identified if they are conjugate by the subgroup ±U = ± (J *) of 
GL 2 (Z/iVZ), the action being ( e Q *) : (c^Gv) (q e/N (%, Cn)> for e = ±1 and a e Z/NZ. We 

lr The material of this addendum will not be used in the present article, but we include it for future references. 
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may, for instance, interpret Coo as the orbit {(C, (q e ^ N CN-> Cat))' 6 "= {^l}) a "= Z/iVZ} of enhanced 
Neron polygons over Z[£jv]- 

It follows that the modular interpretation of Xq associates to each cusp an orbit of our en- 
hanced Neron polygon (C, (q 1 ^, (n)) under the action of the group generated by G and ±U. We 
see from the above that the cusps of Xq have values in a subring of Z[£jv]- Assume moreover 
that Xq is defined over K, of which v is a place of characteristic p, with N = p n N' and p \ N' . 
Extending v to a place of C^[Cjv] if necessary, and setting 0' v := (0[Cn'])v, one sees that the closed 
subscheme of cusps over 0' v may be written as a sum of connected components of shape Spec(i?) 
where R is a subring of 0J,[£ p n]. Therefore if v(N) = 0, the subscheme of cusps is etale over O v , 
but this may not be the case if v(N) > 0. In the latter case, however, the ramification is well 
controlled. Indeed, with the preceding notations, set tt := (1 — Cp 71 )- Any two different p n -th roots 
of unity Q„ and Cp« satisfy (Q^i — Cp™ ) = tt p a with a a w-invertible element and < k < n — 1 . 
As v(tt) = 1/p" (p — 1) (normalizing v so that v(p) — 1), it follows that the Neron polygons 
enhanced with a level- N structure are distinct over Z[£/v]/(7r p The modular interpretation 

shows more precisely that if two different cusps ci and C2 have same reduction at v, then t Cl {c2) 
has v-adic valuation less or equal to l/{p—l) (where t c is the parameter defined at the beginning 
of this Section). This remark will be used later on. 

To illustrate all this with a familiar example, letting G := ( J\ C GL2(Z/iVZ), which gives 
rise to the modular curve Xi(N), one finds that there are |(Z/iVZ) x | cusps, with modular in- 
terpretation corresponding to {(C,Cn) '■ 6 e {^l}} where a runs through (Z/iVZ) x / ± 1, and 
{(C,q ea / N C%) : e € {±1}, a e (Z/JVZ)}, where a runs through the same set. The curve X X (N) 
is defined over Q and has a modular model over Z. The cusps in the former subset above have 
values in Z [(jv + Cjy 1 ] > anc ^ ^ ne CUS P S hi the latter subset have values in Z. In other words, the 
closed subscheme of cusps over Z is isomorphic to the disjoint union of Spec (Z [(n + Cv 1 ]) an d 
|(Z/7VZ) X I jl copies of Spec(Z). 

Let us examine the i>-adic neighborhoods of the cusps. Ifv(N) = then the cusps of Xq define 
a finite etale closed subscheme of Xq over O v , so the sets fl c are obviously pairwise disjoint. 

Now assume that v(N) > 0. Let p be the residue characteristic of v and p n \\ N be the largest 
power of p dividing N = p n N'. The scheme of cusps on Xq may be no longer etale over O v . 
We can however still partition it into connected components, which totally ramify in the fiber 
at v. More precisely, setting as above 0' v := (0[CiV'])u, each connected component over 0' v is 
schematically a Spec(i?) for R a subring of 0'„[Cp™]- Each set fi c as in Proposition 13.11 contains 
exactly one such connected component of cusps, so when R does ramify nontrivially at v, then fl c 
is clearly "too large" (one has Q Cl — fl C2 exactly when c\ and C2 have same reduction at v). We 
want to show that, nevertheless, the refined sets f2+, defined by 

nt = fi+„ = {Peil c : \q c (P)\ v < p-WP-V} 

are pairwise disjoint. 

If the cusps ci and C2 belong to distinct connected components, then already Cl and fi C2 
are disjoint, so 0+ and are disjoint a fortiori. Now assume that ci and C2 belong to the 
same component, i.e., have same reduction at v. In this explained above, one may write 

t Cl {c2) = tt p a € C«[Cp"]i f° r 71 a certain uniformizer (e.g. 7r := (£ p n — 1)), where the element a 
is u-invertible and < k < n - 1. As fi+ is contained in {P S Xq(K v ) : |f cl (P)|t, < p" 1 /^ 1 )} 
(recall q c = t^.", with e c \N), we see that C2 does not belong to 0+, which implies that the sets fi+ 
and are disjoint. 

4 Modular Units 

In this section we recall the construction of modular units on the modular curve Xq. By a modular 
unit we mean a rational function on Xq having poles and zeros only at the cusps. 
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4.1 Integrality of Siegel's Function 

For a £ Q 2 \ Z 2 Siegel's function g a (see Subsection 12. 2[) is algebraic over the field C(j): this 
follows from the fact that g^ 2 is automorphic of certain level [91 page 29] . Since g a is holomorphic 
and does not vanish on the upper half-plane 7i, both g a and g a 1 must be integral over the ring 
C[j]. Actually, a stronger assertion holds. 

Proposition 4.1 Both g a and (1 — £jv) g" 1 are integral over Z[j]. Here N is the exact order of a 
in (Q/Z) 2 and £jy JS a primitive N-th root of unity. 

This is, essentially, established in [9], but is not stated explicitly therein. For a complete proof, 
see [2j Proposition 2.2]. 

4.2 Modular Units on X(N) 

From now on, we fix an integer N > 1. Recall that the curve A (A) is defined over the field 
Q(Cat). Moreover, the field Q(A(A)) = Q(( N )(X(N)) is a Galois extension of Q(j), the Galois 
group being isomorphic to GL2(Z/AZ)/ ± 1. The isomorphism 

Gal(Q(X(iV))/Q(i)) -GL 2 (Z/AZ)/±1 (18) 

is defined up to an inner automorphism; once it is fixed, we have the well-defined isomorphisms 

Gal(Q(X(iV))/Q(Gv,j)) SSL 3 (Z/J\TZ)/±1, Gal(Q(Ov)/Q) = (Z/AZ) X (19) 

(which give the geometric and arithmetic parts of the Galois group respectively) . We may identify 
the groups on the left and on the right in (fl8l and fT9)l . Our choice of the isomorphism (JTSj) will 
be specified in Proposition 14.21 

According to Theorem 1.2 from 9, Section 2.1], given a = (01,02) <E (A^Z) 2 \ Z 2 , the func- 
tion g a 2Ar is r(A)-automorphic of weight 0. Hence g a 2N defines a rational function on the modular 
curve X(N), to be denoted by u a (one checks that u a = u- a ). Since the root of unity in © is of 
order dividing 12A, we have u a = u a t when a = a' mod Z 2 . Hence M a is well-defined when a is a 
non-zero element of the abelian group (A _1 Z/Z) 2 , which will be assumed in the sequel. We put 
A = (A^Z/Z) 2 \ {0}. 

The functions u a have the following properties. 

Proposition 4.2 (i) The functions u a and (1 — Ca^) 12 ^^ 1 are integral over Z[j], where A a is 
the exact order of a in (A _1 Z/Z) 2 . In particular, u a has zeros and poles only at the cusps 
ofX(N). 

(ii) The functions u a belong to the field Q(A(A)), and the Galois action on the set {u a } is 
compatible with the (right) linear action of GL2(Z/AZ) on A in the following sense: the iso- 
morphism (WO) can bechosenso that for any a £ Galf Q(X(N))/ Q(j)) = GL 2 (Z/AZ)/± 1 
and any a £ A we have u a = u aa . 

(Hi) For the cusp Coo at inhnity we have ord c=o u a = 12N 2 £ a , where l a is defined in Subsection \l.l[ 
For an arbitrary cusp c we have |ord c u a | < A 2 . 

Proof Item (JU follows from Proposition 14.11 Item (JTTJ) is Proposition 1.3 from [9l Chapter 2]. 
We are left with item (fmj) . The q-order of vanishing of u a at ioo is \2N£ a . Since the ramification 
index of the morphism X(N) — > X(l) at every cusp is N, we obtain ord Coo u a = 12A 2 £ a . Since 
Ka| < 1/12, we have |ord Coo u a | < A 2 . The case of arbitrary c reduces to the case c = c^ upon 
replacing a by aa where a £ GL2(Z/AZ) is such that er(c) = Coo. □ 

The group generated by the principal divisors (u a ), where a € A, is contained in the group of 
cuspidal divisors on X(N) (that is, the divisors supported at the set C(N) = C(T(N)) of cusps). 



9 



Since principal divisors are of degree 0, the rank of the former group is at most |C(iV)| — 1. It is 
fundamental for us that this rank is indeed maximal. The following proposition is Theorem 3.1 in 
[! Chapter 2). 

Proposition 4.3 The group generated by the set {(u a ) : a € A} is of rank |C(iV)| — 1. □ 

We also need to know the behavior of the functions u a near the cusps, and estimate them in 
terms of the modular invariant j. In the following proposition if is a number field containing (jv 
and v is a valuation of K, extended somehow to K . For v £ Mk we define 

{12AHogiV ifv£M%, 
tfv£M° K axidv(N) = 0, (20) 

12N^j if v £ M Q K and v\p\N. 

We use the notation of Section [3J 

Proposition 4.4 Let c be a cusp of X(N) and v a place of K. For P £ f2 c >v we have 

log \u a (P) \ v - ovd c u a log \t c (P)\ 
For v £ Mf and P £ X(N)(K V ) we have 

|log|«a(P)| | < Nlog(\j(P)\ v + 2400) +p v . 



Proof The first statement for c = Coo is an immediate consequence of Propositions 12.31 and 12.51 
(notice that log \ q c \ v = iVlog \ t c \ v for every cusp c). The general case reduces to the case c = 
by applying a suitable Galois automorphism. The second statement follows from Corollary 1 2. 41 □ 



4.3 K-Rational Modular Units on X G 

Now let if be a number field, and let G be a subgroup of GL2(Z/A^Z) containing — 1. Let det G be 
the image of G under the determinant map det : GL 2 (Z/7VZ) _> (Z/iVZ) x = Gal(Q(0v)/Q) (recall 
that we have a well-defined isomorphism ([1^) ). In the sequel we shall assume that K D Q(Cjv) dct G i 
where Q(CAr) detG is the subfield of Q(Cjv) stable under det G. This assumption implies that the 
curve X G is defined over K. Then G' := Gal (K(X{N)) / K (X G )) is a subgroup of Gj ± 1, which 
contains the geometric part (G n SL2(Z/A r Z))/ ± 1. For every a £ A we put u> a = IlcreG' Uacr - 
Since u &(7 — u°, the functions w a are contained in K (Xq)- They have the following properties. 

Proposition 4.5 (i) The functions w a have zeros and poles only at the cusps of Xq. If c is 
such a cusp, then |ord c u> a | < |G"|A^ 2 . 

(ii) For every a. £ A there exists an algebraic integer A a £ 1>[(n]> which is a product of \G'\ 
factors of the form (1 — (n>) , where N' \ N, such that the functions w a and X^w^ 1 are 
integral over Z[j]. 

(Hi) Let c be a cusp of Xq- Then for v £ Mk ajid P £ O c „ we have 

\\og\w a {P)\ v - orA c w a \og\t c {P)\ v \ < \G'\ Pv . 

(iv) For v £ and P £ X G (K V ) we have 

|logK(P)|„| < |G'|iVlog(|i(P)|„ + 2400)+|G'|p„. 

(v) The group generated by the principal divisors (w a ) is of rank \C(G, K) \ — 1. 
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Proof Items Q and (jnj) follow from Proposition 14.21 items (jrvj) and (fmj) follow from Proposi- 
tion 03] Finally, item (jvj) follows from Proposition 14.31 □ 



4.4 A Unit Vanishing at the Given Cusps 

Item (jvj) of Proposition ^. 51 implies that for any proper subset £ of C(G,K) there is a A-rational 
unit on Xq vanishing at this subset; moreover, such a unit can be expresses as a multiplicative 
combination of the units w a . We call it a Runge unit for E. In this subsection we give a quantitative 
version of this fact. We shall use the following simple lemma, where we denote by || • || i the €i-norm. 

Lemma 4.6 Let M be an s x t matrix of rank s with entries in Z. Assume that the entries of M 
do not exceed A in absolute value. Then there exists a vector b S Z* such that ||b|| i < s s ^ 2+1 A s ~ 1 , 
and such that all the s coordinates of the vector Mb (in the standard basis ) are strictly positive. 

Proof Assume first that s — t. Let d be the determinant of M. Then the column vector 
(\d\, . . . , can be written as Mb, where b = (pi, . . . , b s ) with b k being (up to the sign) the 
determinant of the matrix obtained from M upon replacing the k-th column by (1, . . . , 1). Using 
Hadamard's inequality, we bound \bk\ by s s ' 2 A s ~ 1 . This proves the lemma in the case s = t. The 
general case reduces to the case s — t by selecting a non-singular s x s sub-matrix, which gives s 
entries of the vector b; the remaining t — s entries are set to be 0. □ 

Now let G, K and G' be as in Subsection 14.31 

Proposition 4.7 Let E be a proper subset ofC(G, K) and s a positive integer satisfying s > |E|. 
Then one can associate to every a. € A an integer b & such that 

B:=Y,K\<^' 2+1 (\G'\N 2 ) S ^ (21) 

a<EA 

and the unit w :— IlaeA^a 1 has the following properties. 

(i) If c is a cusp such that the orbit of c is in E then ord c w > (that is, w is a Runge unit 
for E ). 

(ii) There exists an algebraic integer X, which is a product of at most \G'\B factors of the form 
(1 — Cat') , where N' | N, such that Xw is integral over Z[j]. 

(Hi) Let c be a cusp of Xq- Then for v € Mk and P € £l c ,v we have 

| log \w(P)\ v - ordeiolog|f c (P)M < B\G'\ Pv . 

(iv) For v S M|? and P £ X G (K V ) we have 

|log>(P)|„| < B\G'\N log(j j(P)\ v +2400) + B\G'\p v . 

Proof The A'-rational Galois orbit of a cusp c has [K(cj : K] elements. Fix a representa- 
tive in every such orbit and consider the \C(G,K)\ x |A| matrix (ord c ui a ), where c runs over 
the set of selected representatives. According to item (jvj) of Proposition 14. 5i this matrix is 
of rank \C(G,K) \ — 1, and the only (up to proportionality) linear relation between the rows is 
^ c [A"(c) : A]ord c w a = for every a S A. It follows that any proper subset of the rows of our 
matrix is linearly independent. In particular, if we select the rows corresponding to the set E, we 
get a sub-matrix of rank |E|. Applying to it Lemma T4. 61 where we may take A = |G'|A 2 due to 
item (jlj) of Proposition ^. 51 we find integers b & such that (|21[) holds and the function w = Yiae a w a" 
is as desired, by Proposition ^. 51 (H3)-(|Iv|. □ 
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5 Proof of Theorem 11.21 



We use the notations of Section [31 We also write d v — [K v : Q v ] and d = [K : Q]. We fix an 
extension of every v £ Mr to K and denote this extension by v as well. We shall use the following 
obvious estimates for the quantities p v defined in (|2U1) : 

d v p v = UdNlogN, ^ d v p v = 12dN^2^j <12dN log N. (22) 

uSA/~ v&M%- p\N P 

5.1 A Runge Unit 

Fix P e Y G (O s )- Let Si consist of the places v £ M K such that P £ X G (K V )+. Plainly, Si C S. 
For v £ Si let c v be a w-nearby cusp to P (if there are several, choose any of them) and let £ be 
the set of all Gal(K/if)-orbits of cusps containing some of the c v . Then |E| < |5i| < \S\, and since 
l^l < \C(G, K)\ by assumption, £ is a proper subset of C(G, K). It follows from Proposition 14.71 
that there exists a if -rational modular unit w = JlaeA M 'a a such that ord Ct ,u; > for every v £ Si 
(a Runge unit) for which B := X)aeA l^a| satisfies (jl?Tj) . where we may put s = \S\. 

Since w is a modular unit and P is not a cusp, we have w(P) ^ 0, oo, and the product formula 
gives YlveM d v log |ty(P)|„ = 0. We want to show that this is impossible when h(P) is too large. 

5.2 Partitioning the Places of K 

We partition the set of places Mr- into three pairwise disjoint subsets: Mr- = S'iU5'2U53, where 
Si n Sj = for i =/= j. The set Si is already defined. The set S2 consists of the archimedean 
places not belonging to Si- Obviously, Si U S2 C S. Finally, the set S3 consists of the places not 
belonging to Si U S2', in other words, v £ S3 if and only if v is non-archimcdcan and ^'(P)^ < 1. 
We will estimate from above the quantities 

E i = d~ 1 d v log \w(P)\v (i = l,2,3). 

veSi 

We will show that Si < — N~ 1 dh(P) + 0(1), where the 0(l)-term is independent of P (it will be 
made explicit). Further, we will bound S2 and S3 independently of P. Since 

S!+S 2 +S 3 = 0, (23) 

this would bound h(P). 

5.3 Estimating Hi 

For v £ Si we have P £ £l Cv ,vi so we may apply item (fmj) of Proposition 14.71 Since oid Cv w > 1 
and log \q Cv {P)\v — e Cv log \t Cv (P)\ v with e Cv \ N, we have 

Si <d-' d v ^^log\q Cv (P)\ v +B\G'\d- 1 J2 d -P- ( 24 ) 

< AT^cT 1 d v log\q Cv (P)\ v + 24B\G'\NlogN (25) 

< -iV" 1 ^ 1 Y d v \og\j(P)\ v + N- 1 log2 + 24B\G / \N\ogN, (26) 

where we use (p?2j) and ([T7]) . Further, for w e Mr \ Si we have |j(P)|« < 3500 if v is archimedean, 
and |j(P)|ij < 1 if v is non-archimcdcan. It follows that 

d- 1 d v log + \j(P)\ v <log3500. 

«eM K \Si 
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Since > 1 for v e Si, one may also replace \og\j(P)\ v by log + |j(P)|t> in (f2"o) . Hence 

Si < -N^d- 1 d v log + \j(P)\ v + N- 1 \og7000 + 24B\G'\N log N 

veM K 

= -N^hiP) +N- 1 log 7000 + 24S|G'|iVlogiV. (27) 

5.4 Estimating S 2 , S 3 and Completing the Proof 

Item (|ivl of Proposition 14. 71 together with (|22jl implies that 

S 2 < 5|G'|iV log 5900 + 12P|G'|iVlogiV (28) 

Further, let A be from item (JTTJ) of Proposition |4~71 Then h(A) < 12P|G'| AHog 2, because 
h(l — () < log 2 for a root of unity (. For v £ S3 the number j(P) is a v-adic integer. Hence so is 
the number Xw(P). It follows that |u>(P)|„ < | A 1 1 ^ for v £ S3, and 

S 3 < (T 1 ^logjA^ 1 ^ < h(A" x ) =h(A) < 12B|G'|7Vlog2. 

Combining this with (J23]), $27$ and ([28|), we obtain h(P) < 36B\G / \N 2 log2N, which, together 
with mi) implies © with |G|/2 replaced by |G'|. 

If S = Mg? then in the second sum in (j24j) one can replace v £ Mk by w £ M|?. Hence 
in (HU), (HHJ) and ([27]) one may replace 24 by 12, which allows h(P) < 24B|G'|A^ 2 log 37V. This 
completes the proof of the theorem. □ 

6 Special Case: the Split Cartan Group 

When G is a particular group, one can usually obtain a stronger result than in general. For 
instance in [2] we examined the case when N = p is a prime number, G is the normalizer of a split 
Cartan subgroup of GL2(Z/pZ) and K = Q. In this case we obtained the estimate h(P) < C^/p 
for P £ Yq (Z) , with an absolute constant G. In this section we consider the case when N = p is 
a prime number, G is the Cartan subgroup itself, and K is a quadratic field. (Note that the fact 
we are not working over Q any longer makes a significant difference with the normalizer-of-Cartan 
case studied in [2] (appearance of the E^-term), and requires much of the generality of the first part 
of the present work.) Without loss of generality we may assume that G is the diagonal subgroup 
of GL/2(Fp). The modular curve corresponding to this subgroup, is denoted by X sp ,c(p)- It 
parametrizes geometric isomorphism classes (E, (A, P)) of elliptic curves endowed with an ordered 
pair of p-isogenies. There is an isomorphism <f>: Xq(p 2 ) — > X sp ,c(p) over Q defined functorially as 

(E,A P 2) ^ (E/pA p 2,(A p 2/pA p 2,E[p}/ P A p 2)). 

On the Poincare upper half-plane 7i, the map </> is induced by the map r t— > pr. 

The homographic action of the matrix (5 J) on Ti. defines an involution of X sp x;, which mod- 
ularly is (E,(A,B)) -> (E,(B,A)). 

Theorem 6.1 Let p > 3 be a prime number and K a number field of degree at most 2. Then for 
any P £ Y sp . c (p)(0 K ) we have h(P) < 24plog3p. 

Remark 6.2 With some little additional effort, one can obtain a stronger estimate h(P) < Cp 
(and probably even h(P) < Cy/p) with an absolute constant G. However, the bound of Theo- 
rem I6T] is easier to obtain and sufficient for our purposes. 

The curve X sp ,c(j>) has p+1 cusps Coo, cq, c%, . . . , Cp_i corresponding, respectively, to the 
points ioo, 0, 1/p, . . . , (p — l)/p of Ti. The morphism X(p) — > X sp ,c(p) is unramified at the cusps, 
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and the morphism X sp .c — > -^(1) is ramified with index p at all the cusps (as can be immedi- 
ately seen by computing ramification indices), so the local parameter at every cusp c of X sp c is 
t -a 1/p 

L c — He 

The cusps Coo and cq are defined over Q. The cusps ci, . . . , c p _i are defined over the cyclotomic 
field Q(Cp) and are conjugate over Q. Thus the set C(G,Q) consists of 3 elements, and the group 
generated by the principal divisors (w a ) is of rank 2 by Proposition 14.51 item (jvj). Moreover, it is 
clearly generated by the divisors (w a ) with a £ {(l/p, 0), (0, l/p)}- More precisely, we have the 
following. 

Proposition 6.3 The principal divisors (w a ) with a £ {(l/p, 0), (0, l/p)} are 

(^(i/p,o)) = ~^P(p ~ 1 ) 2 ( c °° - pc + ci H h c p _i), (29) 

(w( ,i/p)) = ~^p(p - l) 2 (-pc oc + c + ci H V Cp-i). (30) 

Proof Denote by p _1 F* the set of non-zero elements of p _1 Z/Z. The G'-orbits of (l/p, 0) and 
(0, l/p) are {(a,0) isep^/il} and {(0, a) : a £ p X F* / ± l} respectively, each element of 
each orbit occurring exactly p — 1 times. Since the morphism X(p) — * X sp ,c is unramified at the 
cusps, we have 

ord Coo W(l/p,0) = (P~ !) J! Ord coo«(a,0) = 12p 2 (p- 1) £( o ,0) 
aep- 1 Fj/±l aep-!F^/±l 



by item (jliij) of Proposition [4~2l It follows that 

r J 

#2 f 



/ k \ 1 

ord coo w (i/p,o) = 3p 2 (p - 1) S 2 I - ) = --p{p - l) 2 



k=l 

where we use the identity 



^ \N 6N 
for any positive integer N. In a similar fashion, 

ord Coo ™ (0il/?)) = I2p 2 (p - 1) ]T £ (0 , a) = 3p 2 (p - l) 2 B 2 (0) = X -p 2 (j> - l) 2 . 

aSp-!F^ /±1 

Further, the involution induced by the matrix ( 5 J ) defined before Theorem 16.11 exchanges the 
cusps Co and Coo and the units W(i/ Pi o) an d W(o,i/p)- It follows that 

ord Co W(i/ p ,o) = ord c=o w (0 ,i/p) = p 2 (p - l) 2 /2, ord Co w (0 ,i/p) = ord Coo w (1/p . ) = -p(p ~ i) 2 / 2 - 
Finally, the Galois conjugation over Q shows that 

ord Cl w a = • • • = ord Cp _ 1 w a , 
which implies that ord Cfc w a = —p(p — l) 2 /2 for k £ {1, . . . ,p — 1}. This proves the proposition. □ 

Proof of Theorem 16.11 We have S = Mj??, the set of the archimedean places of the quadratic 
field K. In particular, s — \S\ is 1 or 2. Fix P £ Y sp ,c(Ok)- We use the notation Si and Sj 
of Subsection 15.21 in the present context this means that S 2 = S \ Si and S3 = M^. We again 
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pick for every v € Si a v-nearbjQ cusp c v and set Y, = {c v : v *E Si}. The set £ has at most two 
elements, and we have one of the following three possibilities: 

£ C {Coo,Ci, . . . ,Cp_x}, (31) 

£ C {c ,ci, . . . ,Cp_i}, (32) 
£ C {coo,c }. (33) 

We define iu = tyZ , ^ in the case (|31[). it) = tuTj, x / x in the case (|32p and w — ti>(i/ P) o) u>(o,i/p) m 
the case (|3"3")) . With the notation of Subsection 14.31 and Proposition 14.71 we have 

N = p, |G'| < |G|/2-(p-l) 2 /2, B 



1 in the cases (|3Tj) and j32 

2 in the case (|33l) . 



Now we argue as Subsection 15 . 31 with one very significant distinction: instead of the estimate 
(orderly) /e Cv > iV -1 we use the identity 

oid Cll w _ j{p- l) 2 /2 in the cases (HO} and l|3"2"]). 
e c „ l) 3 /2 in the case 

In the cases (1311) and (13"2l we obtain 



Si <d- x d v ?^-log\q Cv (P)\ v + ~(p-l) 2 d- 1 d -P- 

= kp-l) 2 d- 1 d v log\q Cv (P)U+6(p-l) 2 plogp 

veSi 

< -lip-lfd- 1 d v \og\3(P)\ v + ^(p-l) 2 \og2 + 6(p-l) 2 plogp 
K-lfr-lfdr 1 d v \og+\j(P)\ v + l) 2 log 7000 + 6(p-l) 2 p log] 



2 

~(p- l) 2 h(P) + hip- l) 2 log 7000 + 6(p- l) 2 plogp. (34) 



In the case (|33|) a similar calculation gives 



Si < -i(p - l) 3 h(P) + hip - If log 7000 + 12(p - l) 2 plogp. (35) 



We estimate S2 exactly as in Subsection 15.41 

S 2 < 2"(P- l) 2 p log 5900 + 6B(p- l) 2 plogp. (36) 

Further, in the cases JSTJ and ((32]) we may take A = (1 - C P ) 6p(p ~ 1)2 , which is equal to p 6 ^- 1 ) 
times a unit. And in the case (|33|) we may take A = 1. We obtain 



„ J < 6(p — l)|»logp in the cases (|3Tj) and (f32|) . 
3 1 = in the case (J33J) . 

Combining all the previous estimates, we obtain h(P) < 24plog3p in the cases ([3~Tj) and ([32]) . and 
in the case (133)) we obtain a much sharper estimate h(P) < 72 log 3p. This proves the theorem. □ 



Since v is archimedean, we can write here "the v- nearby cusp". 
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7 An Application: Q-curves of Prime Power Degree 



In this section we apply Theorem 16.11 to the study of Q-curves, proving Theorem 11.31 Let us 
recall some definitions. For a positive integer N with prime decomposition N = p® 1 ■ ■ •pt h , we 
set Xfi(N) := Xq(N) / (w Pi ) , where (w Pi ) is the group of automorphisms of X (N) spanned by 
the Atkin-Lehner involutions w Pi , and Xq(N) := Xq(N)/wn- If K is a number field, it is a 
theorem of Elkies [5] that any i^T-curve of degree N gives rise to a point in Xq (N) (K) . The curve 
Xq (TV) (= Xq (N) if N is a prime power) parameterizes quadratic -FT-curves of degree N (that is, 
elliptic curves defined over a quadratic extension of K and admitting a cyclic isogeny of degree N 
to the if-conjugate curve). 

Theorem ll.3l is a consequence of the following three statements. 

Theorem 7.1 For a prime p > 37 and r > 1, the rational noncuspidal points of X^(p r ) are 
integral; that is, the j-invariant j(P) of any lift P e Xo(p r )(C) of any non-cuspidal point of 
X^(p r )(Q) belongs to Z. 

Theorem 7.2 For every d there is a positive number n(d) such that the following holds. Let E be 
a non-CM elliptic curve defined over a number field K of degree d, and admitting a cyclic isogeny 
over K of degree 5. Then S < n(d) (1 + h(js)) 2 . 

Theorem 7.3 Let p > 3 be a prime number, K be a quadratic number held, r > 1 an integer, 
and P a point ofY (p r ){O K )- Then h(P) < HOplogp. 

Proof of Theorem 11.31 (assuming Theorems 17.11 17.21 and IT.3|> Existence of the degeneracy 
morphisms Xq(p t+2 ) — > X^{p r ) over Q (see, for instance, [18]) shows it is enough to prove the 
result for r — 2 and r = 3. The case r = 2, where Xq(p 2 ) is isomorphic to X sp nt(p), was settled 
in [2], so we are left with the case r = 3. 

Thus, let P e X ( l(p 3 )(Q) be a non-cuspidal and non-CM point, and Q e X (p 3 )(K) a lift of 
it, with values in a quadratic number field K. From Theorem 17.11 we know that Q is an integral 
point if p is large enough. Theorem 17.31 implies that h(Q) < llOplogp. 

Call E the elliptic curve associated to Q. It is a non-CM elliptic curve admitting a cyclic isogeny 
of degree p 3 over K. Theorem 17.21 implies that p 3 < C(l + h(Q)) with an absolute constant C. 
Therefore p 3 < C (plogp) 2 with another absolute constant C and p is bounded. □ 

Theorem l7.1l will be proved in Section[51 It is a generalization of the one used in [5] when r = 2, 
which in that case was originally due to Mazur, Momose and Merel. Theorem 17.21 is a straightfor- 
ward consequence of the isogeny bounds due to Masser and Wiistholz L 13] and Pellarin [20] . See 
[2] Corollary 5.4] for the details. 

Theorem [73] is deduced below from Theorem 16.11 and the following lemma. 

Lemma 7.4 Let E, E' be elliptic curves defined over some number field and linked by an isogeny 
of degree S. Then 

\Hje) - Hje>)\ < 13 log(l + h(j») + 7 log S + 100. 

Proof Denote by hj^{E) the Faltings semistable height of the elliptic curve E. Recall that hj^(E) 
is defined as [K : Q] _1 degu, where if is a number field such that E has semi-stable reduction at 
every place of K, and uj is a Neron differential on E\k\ it is independent of the choice of K and u). 
A result of Faltings [6] Lemma 5] implies that 

|h^(£0-M^)| < 

Further, for any elliptic curve E over a number field we have 

\h(j E ) - 12h^(£)| < 61og(l + h(j E )) + C, 
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with an absolute constant C, see [22| Proposition 2.1]. Pellarin shows that one take C = 47.15, 
see [3D], equation (51) on page 240. Combining all this, we find 

|Hifi) - Hje>)\ < 61og(l + h(j E )) + 61og(l + h(j E ,)) + 6 log 5 + 95, 
which implies the result after a routine calculation. □ 



Proof of Theorem 17.31 We may assume r = 2. Let <f> : Xo(p 2 ) — * X sp ,c(p) be the isomorphism 
defined in the beginning of Section [6] Then the elliptic curve implied by a point P on Xo(p 2 ) is 
p-isogenous to the curve implied by the point P' — <p(P) on X sp ,c(p)- Lemma 17.41 implies that 

|h(P)-h(P')| < 131og(l + h(P')) +71ogp + 100. (37) 

Since P £ Y (p 2 )(Ok) and good reduction is preserved under isogeny, P' belongs to Y spC (p)(Ok) 
as well. Applying Theorem 16.11 to P', we find h(P') < 24plog3p, which, combined with ([571) . 
implies the result. □ 



8 Integrality of (p 3 ) (Q) 

We show that rational points on Xq(j> 3 ) are, in fact, integral. 

Theorem 8.1 For a prime p > 37, and P £ X^ (p 3 )(Q) a non-cuspidal point, the j-invariant of 
any lift of P to X (p 3 )(C) belongs to Z. 

The proof of this theorem is an adaptation of the one we proposed in [2] , relying on results and 
observations of Mazur, Momose and Merel. Actually, Theorem 8.1 was already proven, except 
integrality at 2 when p ^ 1 mod 8, by Momose in [181 Theorem 3.8]; in the present paper we 
however do neeed the whole statement of Theorem 18. II The theorem is probably true for p > 11, 
p 7^ 13 (some cases are indeed given by Momose in loc. cit.); but our assumption that p > 37 
simplifies our arguments. 

If M,N are natural integers and M is a divisor of A, we write ttn,m- Xq(N) — * Xq(M) for 
the degeneracy morphism which is defined functorially as (E, An) >— ► (E, Am), where Am '■= 
E[M] n An- If M and N/M are relatively prime, let wm for the corresponding Atkin-Lehncr 
involution; recall that Xq(N) :— Xo(N)/wn- As usual, we write Jo(iV) for the jacobian over Q of 
X (N), and Jq(N) := Jq(N)/(1 + wn)Jo{N). Models over rings of integers for abelian varieties 
will be Neron models. Recall that, in this paper, the model for Xq(N) over Z that we consider is 
the modular one. Models for those modular curves over arbitrary schemes will be deduced by base 
change. We denote by X (N)^ a the smooth locus of X (N)% (obtained, when TV = p is prime, by 
removing the (super)singular points in the fiber at p). 

As already mentioned in Section [SJ the curve X sp ,c(N) parametrizes elliptic curves endowed 
with an ordered pair of independent A-isogenies. With "ordered" replaced by "unordered" , the 
same is true (at least when A is a prime power) for the curve A sp i;t(A). For each prime p 
dividing A there is an involution on A~ sp .c(A), here also denoted by w p , defined functorially by 




so that A sp iit(A) = A sp .c(A) / (w p : p\N). The map z i— > Nz on the upper half-plane defines the 
Q-isomorphism <fi: Xq(N 2 ) ~ X sp .c.(A) of Subsection [5J inducing an isomorphism 

A split (A)~ X (N 2 )/(w p :p\N) 

on the quotients. 

We recall certain properties of the modular Jacobian Jq(j>) and its Eisenstein quotient J{p) 
(see [14]). 
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Proposition 8.2 Let p be a prime number. Then we have the following. 

(i) Theorem 1 ] The group Jq (p) (Q)tors is cyclic and generated by cl(0 — oo) , where and oo 
are the cusps of X (p). Its order is equal to the numerator of the quotient (p — 1) /12. 

(ii) fTS Theorem 4] The group J(p)(Q) is hnite. Moreover, the natural projection Jo(p) — > J (p) 
defines an isomorphism Jo(p)(Q)tors — > J(p)(Q). 

Remark 8.3 As Mazur notices, Raynaud's theorem on group schemes of type (p, . . . ,p) insures 
that Jo(p)(Q)toi's defines a Z-group scheme which, being constant in the generic fiber, is constant 
outside 2, and which at 2 has etale quotient of rank at least half that of Jo(p)(Q)tors- 

For a point Q <E X^ (p 3 )(Q) and z £ X (p 3 )(K) a lifting of Q with K a quadratic number field, 
the point z corresponds to a couple (E,C p 3) over K, by [1], Proposition VI. 3. 2. Set tt :— ir p 3 p , 
x := Wpir(z) and x := Trw p a(z) G X (p)(K). Writing D p :— p 2 C p 3, the modular interpretation of 
x and xq is therefore (E/D p ,E[p\ mod D p ) and (E/C p 3,E[p\ + C p 3 mod C p s) respectively. For 
t an element in the Z-Hecke algebra for To(p), define the morphism g t from Xo(p 3 ) s ^ to Jo(p)/z, 
which extends by the universal property of Neron models the morphism on generic fibers: 

. r x (p 3 ) j (p) 

9t ' \ Q h-> t-c\(w p ir(Q) -irw p3 (Q)). 

Let Jo(p) — + J{p) be the projection to the Eisenstein quotient, and g t ■= II o g t . 

Lemma 8.4 The morphism gt factorizes through a Q-morphism gf from Xq(p 3 ) to J(p). If 
t ■ (1 + w p ) — 0, the same is true for g t and we similarly denote by gf : Xq(p 3 ) — > Jo(p) the factor 
morphism. 

Proof We compute that, in Jo(p): 

cl((w p 7r(z)) - (irw p 3(z)) - (w p nw p 3{z)) + (tt(z))) = (1 + w p )cl((ir(z)) - (irw p 3 (z))), 

from which we derive the second assertion when i(l + w p ) = 0. As for the first statement, we 
know that (1 + w p ) acts trivially on J(p) from [Ml Proposition 17.10]. □ 

By the universal property of Neron models, we may extend g^~ and g^~ to maps from X^"(p 3 )™ to 

Jo(p) lz an< i J(p) /%i respectively. We still denote those extended morphisms in the same way and 
we put g + = g{ . 

Proposition 8.5 Let P € X+(p 3 )(Q) for some p > 37, let x, xq and K be defined as above, and 
let Ok be the ring of integers of K . Then: 

(i) The isogeny class of elliptic curves associated to P is not potentially supersingular at p. 

(ii) The points x and xq coincide in the fibers of characteristic p of Xq(p) /q k . 

Proof Point Q is Lemma 2.2 (ii) together with Theorem 3.2 of [TB]. Point (p} is proved in 
Proposition 3.1 of [IS]. □ 

Proof of Theorem 18.11 Let P be a non-cuspidal point on X^(p 3 ) with values in Q, and Q a 
lift of P to X {p 3 )(Q). If £ is a finite place of Ok dividing the denominator of j(Q), then Q 
specializes to a cusp at C. Recall that Xq(p 3 ) has two cusps defined over Q, and two other Galois 
orbits of cusps, with fields of definition Q(Cp) + and Q(C P 2 ) + respectively. We first claim that Q 
specializes to one of the rational cusps (which, by changing our lift of P, may be assumed to be 
the oo-cusp, as w p switches the rational cusps). Indeed, it follows from Propositions 18.51 (fH)) that 
g+(P)(W p ) = 0(F P ), and by the remark after Proposition EM 9 + (P){Q) = 0(Q) (recall p > 2). 
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The non-rational cusps of Xq(p 3 )(C) map to cl(0 — oo) in Jo(p)(C) (indeed, w p 3 preserves each 
non-rational Galois orbit of cusps. For more details see for instance the proof of [TTl Proposition 
2.5]). Therefore, as p > 37, Proposition 18.21 implies that if Q specializes to a non-rational cusp at 
C then g + (P) would not be at the characteristics £ of £, a contradiction. 

Now let X be the ideal of the Hecke algebra such that J(p) = Jq(p)/XJq(p). Choose an t- 
adically maximal element t ^ in the Hecke algebra such that t ■ X — 0. Again, as t(l + w p ) = 0, 
the morphism g t factorizes through a morphism : A^(p 3 )™ — > t ■ Jo(p)/z- Moreover gf{P) 

belongs to t ■ Jo(p)(Q), hence is a torsion point, as t ■ Jo(p) is isogenous to a quotient of J(p). 
We see as above by looking at the fiber at p that gt(P) = at p, hence generically, because 
of Proposition 18.21 (or more generally by Raynaud's well-known result on group schemes of type 
(p, . . . , p) on a not-too-ramified discrete valuation ring) . We then easily check by using the q- 
expansion principle, as in [161 Theorem 5], that gf is a formal immersion at the specialization 
oo(F£) of the rational cusp oo on A^(p 3 ). This allows us to apply the classical argument of 
Mazur (see e.g. jT5J proof of Corollary 4.3]), yielding a contradiction with the fact that P is not 
generically cuspidal. Therefore P is not cuspidal at C. □ 
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